Compression of Very Sparse Column Oriented Data
نویسندگان
چکیده
منابع مشابه
Column Stores for Wide and Sparse Data
While it is generally accepted that data warehouses and OLAP workloads are excellent applications for column-stores, this paper speculates that column-stores may well be suited for additional applications. In particular we observe that column-stores do not see a performance degradation when storing extremely wide tables, and column-stores handle sparse data very well. These two properties lead ...
متن کاملVParC: a compression scheme for numeric data in column-oriented databases
Compression is one of the most important techniques in data management, which is usually used to improve the query efficiency in database. However, there are some restrictions on existing compression algorithms that have been applied to numeric data in column-oriented databases. First, a compression algorithm is suitable only for columns with certain data distributions not for all kinds of data...
متن کاملA column-oriented data stream engine
This paper introduces the DataCell, a data stream management system designed as a seamless integration of continuous queries based on bulk event processing in an SQL software stack. The continuous stream queries are based on a predicate-window, called “basket” expressions, which support arbitrary complex SQL subqueries including, but not limited to, temporal and sequence constraints. The DataCe...
متن کاملSequential Data Compression of Very Large Data in Volume Rendering
Improving rendering speed in the visualization of very large volumetric data without loss of information is still a challenge. State of the art methods focus on data reduction by omitting content which does not contribute to the final image. In order to further decrease the amount of data, we present a general approach for applying common sequential data compression schemes to blocks of data, w...
متن کاملDataCommandr: Column-oriented Data Integration, Transformation and Analysis
In this paper, we describe a novel approach to data integration, transformation and analysis, called DataCommandr. Its main distinguishing feature is that it is based on operations with columns rather than operations with tables in the relational model or operations with cells in spreadsheet applications. This data processing model is free of such typical set operations like join, group-by or m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Revista ComInG - Communications and Innovations Gazette
سال: 2016
ISSN: 2448-1904,2448-1904
DOI: 10.5902/2448190422772